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Abstract 

We consider the following general scheduling problem: The input consists of n jobs, each with 
an arbitrary release time, size, and a monotone function specifying the cost incurred when the job is 
completed at a particular time. The objective is to find a preemptive schedule of minimum aggregate 
cost. This problem formulation is general enough to include many natural scheduling objectives, such as 
weighted flow, weighted tardiness, and sum of flow squared. 

The main contribution of this paper is a randomized polynomial-time algorithm with an approxima- 
tion ratio 0(log log nP), where P is the maximum job size. We also give an O(l) approximation in the 
special case when all jobs have identical release times. Initially, we show how to reduce this scheduling 
problem to a particular geometric set-cover problem. We then consider a natural linear programming 
formulation of this geometric set-cover problem, strengthened by adding knapsack cover inequalities, 
and show that rounding the solution of this linear program can be reduced to other particular geometric 
set-cover problems. We then develop algorithms for these sub-problems using the local ratio technique, 
and Varadarajan's quasi-uniform sampling technique. 

This general algorithmic approach improves the best known approximation ratios by at least an expo- 
nential factor (and much more in some cases) for essentially all of the nontrivial common special cases 
of this problem. We believe that this geometric interpretation of scheduling is of independent interest. 

1 Introduction 

We consider the following general offline scheduling problem: 

General Scheduling Problem (GSP): The input consists of a collection of n jobs, and for each job j 
a positive integer release time rj, a positive integer size pj, and a cost or weight function Wj(t) > for 
each t > rj (we are purposely not precise about how these weight functions are represented in the input). 
Jobs are to be scheduled preemptively on one processor after their release times. If job j completes at 
time t, then a cost of Y^s=r-+i w j(t) * s incurred. The scheduling objective is to minimize the total cost, 

Sj=i E s =r-+l w j(t)> where Cj is the completion time of job j. 

This general problem generalizes several natural scheduling problems, for example: 

Weighted Flow Time: If Wj(t) = Wj, where wj is some fixed weight associated with job j, then the 
objective is weighted flow time. 

Flow Time Squared: If Wj(t) = 2(t — rj) — 1, then the objective is the sum of the squares of the flow 
times. 
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Weighted Tardiness: If Wj(t) = for t not greater than some deadline dj, and Wj(t) = Wj for t greater 
than dj , then the objective is weighted tardiness. 

In general, this problem formulation can model any cost objective function that is the sum of arbitrary 
cost functions for individual jobs, provided these cost functions are non-decreasing, i.e. it cannot hurt to 
finish a job earlier. 

Flow time, which is the duration of time Cj — Vj that a job is in the system, is clearly the most natural 
and most commonly used quality of service measure for a job in the computer systems literature. Many 
commonly-used and commonly-studied scheduling objectives are based on combining the flow times of the 
individual jobs. However, flow time is also considered a rather difficult measure to work with mathemati- 
cally. One reason for this is that even slight perturbations to the instance, can lead to lead to large changes in 
the optimum value. Despite much interest, large gaps remain in our understanding for even basic flow time 
based scheduling objectives. For example, for weighted flow time, the best known approximation ratios 
achievable by polynomial-time algorithms are essentially no better than the poly-logarithmic competitive 
ratios achievable by online algorithms. For weighted tardiness, and flow time squared, no nontrivial approx- 
imation ratios were previously known to be achievable. While in contrast, for all of these three problems, 
even the possibility of a polynomial time approximation scheme (PTAS) has not been ruled out. We discuss 
the related previous work further in Section [T31 

1.1 Our Results 

The main contribution of this paper is the design and analysis of a randomized 0(log log nP)-approximation 
algorithm for GSP, where P is the maximum job size. In the special case when all the release times are 0, 
we obtain an 0(l)-approximation algorithm. Let W = max^t Wj(t) be the maximum value attained by any 
weight function. The running time of our algorithm is polynomial in re, log P and log W, provided that we 
can in polynomial time determine the times when a weight function doubles. This is polynomial in the input 
size if the input must contain an explicit representation of the largest possible weight. 

The primary insight to obtain these results is to view the scheduling problem geometrically. The initial 
step is to show that GSP can be reduced (with only a constant factor loss in the approximation ratio) to the 
following geometric set-cover problem that we call R2C: 

Definition of the R2C Problem: The input consists of a collection of V points in two dimensional space, 
and for each point p G V an associated positive integer demand d p . Each point p € V is specified by 
its coordinates (x p , y p ). Further the input contains a collection 1Z of axis-parallel rectangles, each of them 
abutting the y-axis. That is, each rectangle r G 1Z has the form (0, x r ) x {y].,ifi). In addition, each rectangle 
r € 1Z has an associated positive integer capacity c r and positive integer weight w r . The goal is to find a 
minimum weight subset S C 1Z of rectangles, such that for each point p G V, the total capacity of rectangles 
covering p is at least d p , that is, Ylren-p&Tl °r — dp- 

As we shall see later, job sizes will be mapped to rectangle capacities in our reduction, so we will 
also use P to denote the largest capacity of any rectangle. Our algorithm for R2C starts with the natural 
linear programming (LP) relaxation of the problem, strengthened by adding the so-called knapsack cover 
inequalities. To round this LP solution, our algorithm then proceeds in a way that is by now standard (see 
for example lPT2l ) in the applications of knapsack cover inequalities. In the terminology of lfl2l . we reduce 
the problem to rounding an LP solution for the so-called priority set cover version of the problem and in 
addition several set multi-cover problems. These resulting problems are simpler as they are uncapacitated. 
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In particular we proceed as follows. The algorithm first picks rectangles that are selected by the LP 
solution to a significant extent (i.e. x r > (3, for some fixed constant /?), and then considers the residual 
solution. The knapsack cover inequalities guarantee that remaining LP variables for a feasible solution to 
the residual instance. Since all variables x r < (3 in this solution, the capacities and demands can be rounded 
to powers of 2, and the variables can be scaled by a constant factor, so that each point's demand is covered 
several times over. 

Points are then classified as heavy or light depending on whether or not the optimal LP solution exten- 
sively covers the point with rectangles whose capacity is larger than the demand of the point. We reduce 
the problem of covering the heavy points by rectangles with higher capacity to the geometric cover problem 
R3U defined below. We show that the instances of R3U that we obtain have boundaries with low union 
complexity. In particular, the boundary of the union of any k objects has a complexity of 0(k log P). Using 
Varadarajan's quasi-uniform sampling technique [23] for approximating weighted set cover on geometric 
instances with low union complexity, one can obtain a covering that is an 0(log log nP)-approximation to 
fractional cover specified by the LP solution. 

Definition of the R3U Problem: The input consists of a collection of V points in three dimensional space. 
Each point p G V is specified by its coordinates (x p , y p , z p ). Further the input contains a collection 1Z of 
axis-parallel right cuboids each of them abutting the xy and yz coordinate planes. That is, each right cuboid 
r G 1Z has the form (0, x r ) x (y^,y^) x (0,z r ). In addition, each right cuboid r G 1Z has an associated 
positive integer weight w r . The goal is to find a minimum weight subset S C 1Z of cuboids such that each 
point p G V is covered by at least one cuboid. 

We reduce the problem of covering the light points to log P different instances, one for each possible 
job size, of the weighted geometric multi-cover problem R2M defined below. We then show how to use the 
local ratio technique to obtain a solution for each instance of R2M that is 0(log log nP) -approximate with 
the cost in the optimal LP solution for jobs of this size. Combining these solutions for various sizes implies 
a solution for covering all light points with cost 0(log log nP) times the LP cost. 

Definition of the R2M Problem: The input consists of a collection of V points in two dimensional space, 
and for each point p G V an associated positive integer demand d p . Each point p G V is specified by 
its coordinates (x p , y p ). Further the input contains a collection 1Z of axis-parallel rectangles, each of them 
abutting the y-axis. That is, each rectangle r G 1Z has the form (0, x r ) x (y^,y^). In addition, each rectangle 
r G TZ has an associated positive integer weight w r . The goal is to find a minimum weight subset S C 1Z of 
rectangles, such that for each point p G V, the number of rectangles covering p is at least d p . 

1.2 Identical Release Times 

In the instances of R2C that arise from our reduction from the general scheduling problem, in the special 
case of identical release times, all the points lie on a line, and the rectangles are one-dimensional intervals. 
This is precisely the generalized caching problem, for which a polynomial-time 4-approximation algorithm 
is known [5 ] (see also lfl2l . for a somewhat more systematic approach to it). Thus we conclude that there is 
a polynomial-time 0(l)-approximation algorithm for GSP when all release times are identical. 

1.3 Related Results 

Let us first consider weighted flow time. [2] gives an online algorithm that is 0(log W^-competitive, and 
a semi-online algorithm (which means that the parameters P and W must be known a priori to the online 
algorithm) that is 0(log nP) -competitive. 11151 gives a semi-online algorithm that is 0(log 2 P) -competitive. 
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These online algorithms also give the best known approximation ratios for polynomial time algorithms. lfT4l 
gives a (1 + e) -approximation algorithm that has running time n O((iogPiogW)/e >. Thus, this gives a quasi- 
polynomial time approximation scheme (QPTAS) when both P and W are polynomially bounded in n. 
Moreover, [14] also gives a QPTAS for the case when only one of either P or W is polynomially bounded 
in n. In the special case that the weights are the reciprocal of the job sizes, and hence the objective is average 
stretch/slow-down, then there is a polynomial time approximation scheme ll8l[T4l. 

It is also known that the algorithm highest density first is (1 + e)-speed 0(l)-competitive for weighted 
flow [7] and flow squared [4j. No other approximation guarantees are known for flow squared. An n — 1- 
approximation algorithm is known for weighted tardiness if all jobs are released at the same time |[T6l . and 
nothing seems to be known for arbitrary release dates. PTAS's are known with the additional restriction that 
there are only a constant number of deadlines lfT8l or if jobs have unit size [ 19 j. In general, there has been 
other extensive work on flow time related objectives and we refer the reader to ll22l for a survey. 

The goal in geometric set cover problems is to improve the O(logn) set-cover bound using geometric 
structure. This is an active area of research and various different techniques have been developed. However, 
until recently most of these techniques applied only to the unweighted case. A key idea is the connection 
between set covers and e-nets [9 ], where an e-net is a sub-collection of sets that covers all the points that 
lie in at least an e fraction of the input sets. For any geometric problem, existence of e-nets of size at most 
(l/e)<?(l/e) implies 0((7(OPT))-approximate solution for unweighted set cover |9). Thus, proving better 
bounds on sizes of e-nets (an active research of research is discrete geometry) directly gives improved guar- 
antees for unweighted set-cover. In a surprising result, ifTTIi related the guarantee for unweighted set-cover 
to the union complexity of sets. If particular, if the sets have union complexity 0(nh(n)), which roughly 
means that the number of points on the boundary of the union of any collection of n sets is 0(nh(n)), then 
one can obtain an 0(h(n)) approximation iTTTl . This was subsequently improved to 0(\og(h(n)) ll23l . In 
certain cases these results also extend to the unweighted multi-cover case 1131 . However, these techniques 
do not apply to weighted set cover problems: the problem is that these techniques may sample some sets 
with much higher probability than that specified by the LP relaxation. In a recent breakthrough, Varadara- 
jan gave a new quasi-uniform sampling technique 11241 that obtains a 2°( log * n ) log(/i(n)) approximation 
for weighted geometric set cover problems with union complexity 0(nh(n)). In fact his result gives an 
improved guarantee of 0(log h(n)) if h(n) grows with n (even very mildly such as log log • • • log n, where 
the log is iterated 0(1) times). 

Organization: The paper is organized as follows. In section |2]the reduction from GSP to R2C is given. 
In section [3] we give the LP formulation of R2C and explain the initial preprocessing of the LP solution. In 
section |4] we explain how to reduce part of the problem of rounding the LP solution to an instance of the 
R3U problem. In section [5] we explain how to reduce part of the problem of rounding the LP solution to an 
instance of the R2M problem. 

2 The Reduction from GSP to R2C 

Our goal in this section is to prove TheoremQ] We accomplish this by giving a reduction from GSP to R2C, 
and then showing that this reduction increases the objective value of the optimal solution by at most a factor 
of four (LemmaO, and that this reduction doesn't shrink the objective value of the optimal solution (Lemma 
&. 

Theorem 1. A polynomial-time a-approximation algorithm for R2C implies a polynomial-time 4a approx- 
imation algorithm for GSP. 
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Definition of the Reduction from GSP to R2C: From an arbitrary instance X of GSP, we explain how to 
create an instance X' of R2C. Considering X, we say that a time t > rj is of class k > 1 with respect to job 
j if the cost of finishing j at time t lies in [2 fc_1 , 2 k - 1], i.e. £* /=1 ^(t') € [2 fc_1 , 2 fc - 1]. We say that t is 
of class 0, if the cost of finishing j at t is 0. Let I k denote the (possibly empty) time interval of class k times 
with respect to job j. Let T denote the set of all points that are endpoints of the intervals of the form I k for 
some job j and class k. For each time interval X of the form X = [ii, £2). where t\ < £2 and t\, ti € T, we 
create a point p in X' with demand d p = max(0, P(X) — \X\) = max(0, P(X) — (£2 — h)), where P(X) 
denotes the total size of jobs that are released during X, i.e. P(X) = Ylj- r -e[ti t 2 ) Pr ^ or eacn J orj 3 * n ^ 
and k > 0, we create a rectangle R k = [0, rf] x I k in X' with capacity pj and weight 2 k — 1. We note that 
the rectangles R J ,R{,... corresponding to the same job are pairwise disjoint. 

Without loss of generality, we may assume that the time horizon is nP, otherwise the instance can 
be divided into disjoint non-interacting subsets. Thus the maximum cost for any job can be nPW, so k < 
min(nP, \og(nPW). This implies that we can assume that log W = 0(nP) and that \T\ = 0(n \og(nPW)), 
i.e. polynomial in the size of the input. Throughout the paper we will use m to denote the number of points 
in the R2C problem. Clearly, m = 0(\T\ 2 ). 

Lemma 2. If there is a feasible solution S toX with objective value v, then there there is a feasible solution 
S' to X' with objective value at most Av. 

Proof. For job j in X, let k(j) denote the class during which j finishes in S (i.e. k(j) is the smallest integer 
such that the cost incurred by j in S is < 2 k ^) — 1). Consider the solution S' obtained by choosing for 
each job j, the intervals ijj, . . . , iLjy Clearly, each job contributes at most Yli=o 2* — 1 < 2(2 k W — 1) < 

4 . 2 fc 0) _1 , i.e. at most 4 times its contribution to S, and hence the total cost of S' is at most 4 times the cost 
of S. 

It remains to show that S' is feasible, i.e. for any point p, the total capacity of rectangles covering p is 
at least d p . Suppose p corresponds to the time interval X = [ii, £2) from X. Let Jx denote the jobs that 
arrive during X. For each job j S Jx that completes after ti, there is exactly one rectangle R k that covers 
p. Since S is a feasible schedule, the total size of jobs in Jx that can complete during X itself cannot be 
more than \X\ = t2 — t\. Thus the jobs in Jx that do not complete during X must have a total size of at 
least P{Jx) — \X\, which is the covering requirement for p. □ 

Lemma 3. If there is a feasible solution S' to X' with objective value v', then there there is a feasible 
solution S toX with objective value at most v. 

Proof. For each job j, let h(j) denote the largest index such that the rectangle R 3 h ^ lies in S' . Let us set a 

deadline dj for j as the right end point of I 3 h ^y 

We claim that there is a schedule S that completes each job j by time dj. Consider the bipartite graph 
defined as follows: We have time slots 1, 2, . . . , T on the right. For each job j, we have pj vertices on the 
left, each of which is connected to vertices rj, ... ,dj — 1 on the right. By Hall's theorem, a feasible schedule 
exists if and only if for any time interval X, the total size of jobs that have both release times and deadlines 
in X is at most |X|. Moreover, it suffices to show such a result for intervals X of the form [r a , df,), for some 
jobs a and b. Equivalently, for any such time interval X, the jobs j € Jx that are released during X and 
have dj after the end of X, have a total size of at least P(Jx) — \X\. 

Note that by the definition of T, then there is a point p in X' that corresponds to the interval X. Then by 
the feasibility of S', the total capacity of rectangles covering p in S' is at least P(Jx) — \X\. And as all of 



5 



these rectangles correspond to different jobs in X (the rectangles corresponding to the same job are pairwise 
disjoint), we are done. 

In S the cost of j is at most 2 h ^ — 1, since by the definition of the rectangle R{ the cost of finishing 
a job by deadline dj is at most 2 h ^ — 1. Now, the cost incurred by j in I' is at least 2 h w — 1 (since the 
rectangle R 3 h ^ already has cost 2 h ^ — 1). This implies that the cost of S is at most that of S'. □ 

Identical Release times: Without loss generality, let r,- = for all j. In this case, the above reduction 
become simpler. In particular, the first dimension corresponding to release time becomes irrelevant and we 
obtain the following problem. For each job j and k > 0, there is an interval l{ corresponding to class k 
times with respect to j and has capacity pj and weight 2 k — 1. All relevant intervals X are of the form 
[0, t] for t € T and have demand Jx — \X\ = D — t, where D is the total size of all the jobs. For each 
such X = [0, t), we introduce a point t with demand d t = D — t. The goal is to find a minimum weight 
subcollection of intervals I{ such that covers the demand. This is a special case of the following Generalized 
Caching Problem. 

Generalized Caching Problem: The input consists of a set of demands d(t) at various time steps t = 
1, . . . , n. In addition there is a collection of time intervals I, where each interval I El has weight wi, size 
c/ and span [sj, tj] with sj,tj 6 {1, . . . , n}. The goal is to find a minimum weight subset of intervals that 
covers the demand. That is, find the minimum weight subset of intervals SCI such that 

c i> d t Vt€{l,...,n}. 

ieSitelsuti] 

A 4-approximation for this problem was obtained by Bar-Noy et al. 0, based on the local-ratio tech- 
nique. Their algorithm can equivalently be viewed as a primal dual algorithm applied to a linear program 
with knapsack cover inequalities [6]. This immediately implies a 16-approximation for GSP in the case of 
identical release times. 



3 The LP Formulation for R2C 

The following is a natural integer programming formulation for R2C. For each rectangle r G 1Z there is an 
indicator variable x r specifying whether or not the rectangle r is selected. 

min w r x r s.t. 

reTl 

c r x r > dp \/p € V (1) 

r:pdr 

x r €{0,1} r ell (2) 

It is easily seen that the natural relaxation of this linear program, where x r £ {0, 1} is replaced by 
x r £ [0, 1], has a large integrality gap. In particular, this is true even when V consist of a single point, in 
which case the problem is equivalent to the knapsack cover problem ifTTl . Thus, we strengthen this LP by 
adding knapsack cover inequalities introduced in [ 1 1 ] have proved to be a useful tool to address capacitated 
covering problems fllUDl 1251 151 H21 
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This gives the the following linear program: 

min w r x r s.t. (3) 

rell 

min {c r , max(0, d p — c(S))} x r > 

r£ll\S:p£r 

d p - c{S) Vp G V, S C K (4) 
x r G [0, 1] Vr G ft (5) 

Here c(5) denotes the total capacity of rectangles in S. The constraints are valid for the following reason: 
For any subset S, even if all the items in S are chosen, at least a demand of d p — c(S) must be covered 
by remaining rectangles. Moreover, truncating an item size to the residual capacity does not affect the 
feasibility of an integral solution. Even though there are exponentially many constraints per point, a feasible 
(1 + e) -approximate solution, for any constant e > 0, can be found using the Ellipsoid algorithm, see iPTTll 
for details. Further only the cost incurs the (1 + e) factor loss, all the constraints are satisfied exactly. We 
will refer the inequalities in line (@]) as the knapsack cover inequalities. 

Let x be some (1 + e)-approximate feasible solution to the linear program for R2C in lines ©-©, and 
let OPT denote x's objective value. 

We now apply some relatively standard steps to simplify x. Let /3 be a small constant, /3 = 1/12 suffices. 
Let S denote the set of rectangles for which x r > ft. We pick all the rectangles in S, i.e. set x T = 1. Clearly, 
this cost of this set is at most 1//3 times the LP solution. 

For each point p, let S p = S n {r : r G lZ,p G r} denote the set of rectangles in S that cover p. Let us 
consider the residual instance, where the set of rectangles is restricted to TZ \ S and the demand of a point is 
dp — c(Sp). If dp — c(Sp) < 0, then p is already covered by S and we discard it. 

Since the solution x satisfied all the knapsack cover inequalities for each point p and set S, and hence in 
particular for every p and corresponding the set S p , we have that 

min {c r , dp — c(S p )} x r > d p — c(S p ) 

r£lZ\S p :p£r 

Henceforth, this is the only fact we will use about the solution x (in particular, we do not care that x satisfies 
several other inequalities for each point p). Let us scale the solution x restricted to 1Z \ S by 1//3 times. Call 
this solution x'. Note that since x r < f3, it still holds that x' r G [0, 1]. Clearly, x' satisfies 



E 



mm{c r ,d p — c(S p )}x' r > 



dp c(Sp 



P 

re^\5 p :pGr 

Let us define the new demand d' of p as d p — c(S p ) rounded up to the nearest integer power of 2. Similarly, 
defined a new capacity c', of each rectangle r to be c r rounded down to the nearest integer power of 2. x' 
still satisfies, 

d' 

min{ C ;,^}4>^ 

ren\S p :p& 

We call r a class i rectangle if c' r = 2*. Similarly, p is a class i point if d' p = 2*. We call a point p heavy 
if is covered by rectangles with class at least as high as that of p in the LP solution, more precisely if: 

^2 min(4, d' p )x' r > d' p . (6) 

r&l' :c' r >d' v 
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Equivalently, p is heavy if 

E x ' r — 

Otherwise we say that a point is //gfe. Thus a light point satisfies: 

£ «>(^-i)4 = (^)< m 

We now have different algorithms for covering heavy and light points. 



4 Covering Heavy Points 

In this section we show how reduce the problem of covering the heavy points by larger class rectangles 
to R3U. We then show that the resulting instances of R3U have low union complexity. In particular any 
k cuboids in a resulting R3U instance has union complexity 0{k log P). By Varadarajan's quasi-uniform 
sampling technique [23 ] this gives a solution that is an 2°( log * m ) log log P = 0(log log nP) approximation 
to the optimal fractional solution of this R3U instance. As x' gives a feasible fractional solution to this R3U 
instance, this means that the cost of cuboids that the algorithm selects is O(loglognP) approximate with 
OPT. 

The Problem of Covering the Heavy Points to R3U: The reduction takes as inputs the instance X' for 
heavy points obtained at the end of the previous section, and the LP solution x' and creates an instance A 
of R3U. For each heavy point p = (x, y) G X' with demand d' p , there is a point (x, y, d! ) in A. For each 
rectangle r = [0, x] x [2/1,2/2] in X' with capacity c' r , we define a right cuboid R r = [0, x] x [2/1,2/2] x [0, d r ] 
of weight w r . 

It is clear that there is a one to one correspondence between a covering of heavy points in X' by rectangles 
of no smaller class and a covering of the points in A by cuboids. Given a collection X of n geometric objects, 
the union complexity of X is number of edges in the arrangement of the boundary of X. For 3-dimensional 
objects, this is the total number of vertices, edges and faces on the boundary of X. In Lemma[4]and Lemma 
[5] we bound the union complexity of cuboids in A. 

Lemma 4. For any collection ofk rectangles of the type [0, r] x [s, t], the union complexity is 0(k). 

Proof. For each rectangle of the form [0, r) x [s,t] has a side touching the y-axis. Let us view of union 
of k such rectangles from (00, 0). Consider the vertical faces on the boundary of the union. For any two 
rectangles a and b, the pattern abab or baba cannot appear. Thus the vertical faces from a Davenport Schinzel 
sequence of order 2, which has size at most 2k — 1 (see for example Ell , chapter 7). Since the number of 
vertices is O(l) times the number of faces, the result follows. □ 

Lemma 5. The union complexity of any k cuboids in TZ is 0{k log P). 

Proof. This directly follows from lemma @] and noting that the number of distinct heights is O(logP). In 
particular, since the heights of powers of 2, consider the slice of the arrangement between z = 2 % and 
z = 2 l+l . This corresponds to union of rectangles of the form [0, r] x [s, t]. □ 

Remark: We remark that the bound in lemma [5] is tight for kind of cuboids we consider here. 
The following result is implicit in |[24l . 
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Theorem 6 ( 11241 ). There is a randomized polynomial-time algorithm that, given a weighted geometric set 
cover instance I where the union complexity of any k objects is k * g{k), produces an set cover of weight at 
most a factor of2°( log l J D log times the optimal fractional set cover. 

If the function g(n) grows even very mildly with n, say in particular that g(n) > log log • • • log n, where 
the log is iterated O(l) times, then the approximation guarantee above is 0(log <?(|/|)). 

Thus we can conclude that in polynomial time one find rectangles in the R2C instance X' that covers all 
the heavy points and that has weight at most 0(log log nP) times OPT. 



5 Covering Light Points 

In this section we show how to decompose the problem of covering the light points to log P instances of 
R2M, one instance P>i for each possible rectangle capacity class I. The decomposition ensures that an a 
approximation for R2M implies an cover for light points in I' with cost 0(a) times OPT. We then give 
an obtain an O(loglogra) = 0(log log nP) approximation for an R2M instance on m points. To do this, 
we relate the multi-cover problem to the set cover problem (where all demands are 1) and show that the set 
cover problem has a 2-approximation with respect to the fractional solution. This implies that the cost of 
rectangles that the algorithm selects for X' is O(loglogm) approximate with OPT. 

Remark: Better results for the R2M problem can be obtained by adapting Varadarajan's quasi-uniform 
sampling technique to multi-cover instances. However, we follow the simpler approach here since it suffices 
for our purposes. 

The Problem of Covering the Light Points to the instances ofRIM: The reduction takes as inputs the 
instance X 1 for R2C (restricted to light points), and the LP solution x' and for each I = 0, 1, 2, . . . creates 
an instance Bi of R2M. The points in Bi are the same as the points in X' . The demand of a point p in Bt is 
defined as d e p = [J2r-c'(r)=2 e x 'r\- The rectangles in Bi are precisely the class I rectangles in X' , i.e. those 
of capacity exactly 2 e . The weight of the rectangles in B^ are the the same as in X'. The goal is to cover 
each point p G B(_ by d e p distinct rectangles. 

Lemma 7. Consider the union S of the rectangles picked in the solutions Si to the instances £>£. Then S 
satisfies the demand of all the light points in X'. 

Proof. Consider a particular point p and suppose it lies in class i in X', i.e. its demand d'(p) = 2 l . Then the 
extent to which p is covered by \J e Se is at least 

Ki Ki r:c'(r)=2 e and per 

> £ x' r )-l) 

Ki r . c /( r ) =2 * and per 

a IE* E < 

\e<i r:c'(r)=2 l and per 

E 2 ' E *r)-d'(p) 

£<i r:c f(r)=2 t and per 
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where last inequality follows from ©. Since /3 = 1/12, it follows the each p is covered. □ 

Henceforth we focus on a particular instance of R2M. Let I be such an instance with n rectangles (sets) 
S\,...,S n and m points (elements) 1, . . . , m. Let di denote the covering requirement of i. We are given 
some fractional feasible solution x, i.e. for each i J2j-ies x i — ^ and Xj G [0, 1] for all Sj. The following 
lemma is standard. 

Lemma 8. For any multi-cover problem, at the loss of an O '(1) factor in approximation ratio, we can assume 
that the maximum demand d = maxj di is 0(log m). 

Proof. We pick each set Sj with probability min(l, 2xj). The expected cost of the sets picked is at most 
twice the LP cost. By standard Chernoff bounds, for some large enough constant c each element with 
demand di > c log m is covered with probability at least 1 — 1/ m 2 . In the residual instance, each uncovered 
element has demand 0(log m) and as xj < 1 for each set, the LP solution restricted to the unpicked sets is 
a feasible solution to the residual instance. □ 

The following lemma shows how a rounding procedure for a set cover problem can be used for corre- 
sponding multi-cover problem. 

Lemma 9. An LP-based a approximation algorithm for a weighted set cover problem can be used to obtain 
an a log d approximation for any multi-cover variant of the problem where d is the maximum demand of any 
element. 

Proof. Let x be some feasible fractional solution to the multi-cover problem. Our algorithm proceeds in 
d rounds, and picking some sets in each round such that after d rounds, each pi is covered by at least di 
distinct sets. Inductively, assume that at beginning of round r each element has an uncovered demand of at 
most d — r + 1. This is clearly true for r = 1. For round r = 1, . . . , d, we proceed as follows. Consider the 
LP solution y( r ) = x/(d — r + 1), restricted to the sets not chosen thus far in previous rounds. Let P r be 
the elements with (current) demand exactly d — r + 1. We claim that y^ r > is a feasible fractional set cover 
solution for P r . If % € P r had requirement di initially, then it has been covered a = di — (d — r + 1)) times 
thus far. As each xj < 1, the solution x restricted to sets not picked this far still covers i to extent di — Ci 
and hence y^ must cover i fractionally to extent at least (di — Ci)/(d — r + 1) > 1. 

Let C r denote the cover for P r obtained by applying our set cover rounding procedure to yv). We return 
the solution C% U . . . U C^. In this solution, each element i is covered at least di times, and its cost is 
Ylr=l a ' cost(y( r )) < Yl r =l a ' cost(x/(d — r + 1)) = alogd ■ cost(x). □ 

We now give a 2 approximation for R2M using local ratio. We refer the reader to |5l for a general 
description of the technique. While we use local ratio below, our approximation can be easily made LP- 
based using the equivalence between local ratio and the primal dual method [6]. 

Lemma 10. There is a 2-approximation for the R2M problem when all the demands are O(l). 

Proof. The algorithm is a straight-forward application of local ratio rule. We adopt the notation from all 
the local ratio rule papers. Let w be the original weight function. Consider the rightmost point p to be 
covered, that is the point p with maximum x coordinate (if there are several, pick one arbitrarily). Let z be 
the minimum weight of a rectangle covering p. Define the weight function w\ = z for rectangles that cover 
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p, and for the other rectangles. Let W2 = w — w\ be the residual weight function. Recall that the local ratio 
rule tentatively picks all the sets X with W2 weight 0, removes the covered points and proceeds recursively 
on the residual instance with function W2- Let 52 be the solution obtained recursively by the local ratio for 
the residual instance. We then add all the rectangles in X and perform the greedy-delete step, i.e. remove 
them arbitrarily as long as solution is feasible. 

As p must be covered, any optimum solution must incur a w\ cost of z. It suffices to show that at most 
two rectangles with non-zero w\ weight can be picked by the algorithm. Suppose more than two are left 
after the delete step. But as p is the rightmost point, any rectangle that covers p and is different from the one 
with the topmost edge or the one with the bottommost edge will be redundant. □ 
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